AITopics | causal imitation learning

environmentinvariantenvironment specific

Neural Information Processing SystemsFeb-7-2026, 19:24:10 GMT

artificial intelligence, latexit sha1, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Greater London > London (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Causal Imitation Learning With Unobserved Confounders

Neural Information Processing SystemsDec-24-2025, 07:08:40 GMT

One of the common ways children learn is by mimicking adults. Imitation learning focuses on learning policies with suitable performance from demonstrations generated by an expert, with an unspecified performance measure, and unobserved reward signal. Popular methods for imitation learning start by either directly mimicking the behavior policy of an expert (behavior cloning) or by learning a reward function that prioritizes observed expert trajectories (inverse reinforcement learning). However, these methods rely on the assumption that covariates used by the expert to determine her/his actions are fully observed. In this paper, we relax this assumption and study imitation learning when sensory inputs of the learner and the expert differ. First, we provide a non-parametric, graphical criterion that is complete (both necessary and sufficient) for determining the feasibility of imitation from the combinations of demonstration data and qualitative assumptions about the underlying environment, represented in the form of a causal model. We then show that when such a criterion does not hold, imitation could still be feasible by exploiting quantitative knowledge of the expert trajectories. Finally, we develop an efficient procedure for learning the imitating policy from experts' trajectories.

causal imitation learning, name change, unobserved confounder, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.60)

Add feedback

Causal Imitation Learning with Unobserved Confounders

Neural Information Processing SystemsAug-15-2025, 02:50:41 GMT

One of the common ways children learn is by mimicking adults.

imitation, policy space, trajectory, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.28)
North America > United States > Oregon > Benton County > Corvallis (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(7 more...)

Genre: Research Report (0.46)

Industry:

Information Technology (0.67)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.69)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.68)

Add feedback

Review for NeurIPS paper: Causal Imitation Learning With Unobserved Confounders

Neural Information Processing SystemsJan-26-2025, 13:25:10 GMT

Weaknesses: Generally speaking, I have several doubts in the current paper. In the paper the authors explored the problem of imitation learning, based on a very restrictive assumption that the causal diagram is given. This is highly impossible in most real world applications, which renders the proposed approach not that useful on the practical side. Even if we assume that we know the causal models on the observed variables, how could we know how many hidden confounders exist among them and how hidden confounders influence among them? Apparently, hidden confounders play an extremely essential role in the proposed method in this paper.

causal imitation learning, imitation surrogate, unobserved confounder, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.96)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Review for NeurIPS paper: Causal Imitation Learning With Unobserved Confounders

Neural Information Processing SystemsJan-26-2025, 13:25:03 GMT

The paper consideras a very general setting with possible unobserved confounders, expert and policy can have different inputs and the reward being unobserved. The work presents multiple criteria for ensuring successful imitation in particular based on proxy variables for task rewards.

assumption, causal imitation learning, imitation, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Robots (0.72)

Add feedback

Causal Imitation Learning With Unobserved Confounders

Neural Information Processing SystemsOct-10-2024, 18:43:02 GMT

One of the common ways children learn is by mimicking adults. Imitation learning focuses on learning policies with suitable performance from demonstrations generated by an expert, with an unspecified performance measure, and unobserved reward signal. Popular methods for imitation learning start by either directly mimicking the behavior policy of an expert (behavior cloning) or by learning a reward function that prioritizes observed expert trajectories (inverse reinforcement learning). However, these methods rely on the assumption that covariates used by the expert to determine her/his actions are fully observed. In this paper, we relax this assumption and study imitation learning when sensory inputs of the learner and the expert differ. First, we provide a non-parametric, graphical criterion that is complete (both necessary and sufficient) for determining the feasibility of imitation from the combinations of demonstration data and qualitative assumptions about the underlying environment, represented in the form of a causal model.

causal imitation learning, trajectory, unobserved confounder, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.63)

Add feedback

Causal Imitation Learning with Unobserved Confounders

#artificialintelligenceAug-15-2022, 00:11:38 GMT

One of the common ways children learn is by mimicking adults. Imitation learning focuses on learning policies with suitable performance from demonstrations generated by an expert, with an unspecified performance measure, and unobserved reward signal. Popular methods for imitation learning start by either directly mimicking the behavior policy of an expert (behavior cloning) or by learning a reward function that prioritizes observed expert trajectories (inverse reinforcement learning). However, these methods rely on the assumption that covariates used by the expert to determine her/his actions are fully observed. In this paper, we relax this assumption and study imitation learning when sensory inputs of the learner and the expert differ. First, we provide a non-parametric, graphical criterion that is complete (both necessary and sufficient) for determining the feasibility of imitation from the combinations of demonstration data and qualitative assumptions about the underlying environment, represented in the form of a causal model. We then show that when such a criterion does not hold, imitation could still be feasible by exploiting quantitative knowledge of the expert trajectories. Finally, we develop an efficient procedure for learning the imitating policy from experts' trajectories.

causal imitation learning, unobserved confounder

#artificialintelligence

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.53)

Add feedback

Filters

Collaborating Authors

causal imitation learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

environmentinvariantenvironment specific

Causal Imitation Learning With Unobserved Confounders

Causal Imitation Learning with Unobserved Confounders

Review for NeurIPS paper: Causal Imitation Learning With Unobserved Confounders

Review for NeurIPS paper: Causal Imitation Learning With Unobserved Confounders

Causal Imitation Learning With Unobserved Confounders

Causal Imitation Learning with Unobserved Confounders